PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D03G0831
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 666aa    MW: 73263.2 Da    PI: 8.3096
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D03G0831genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.16.5e-3093177187
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW++qe+laL+++r++m+  +r++  k+plWe+vs+k++e g++rs+k+Ckek+en++k+yk++k+g+++r++++s  +++f++lea
  Gh_D03G0831  93 RWPRQETLALLKIRSDMDGIFRDATVKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHKYYKRTKDGRGGRQDGKS--YKFFSELEA 177
                  8*********************************************************************866665..******985 PP

2trihelix110.11.4e-34489574187
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW+k evlaLi++r+ +e+r++++  k+plWee+s+ m++ g++rs+k+Ckekwen+nk++kk+ke++kkr +e+ +tcpyf+ql+a
  Gh_D03G0831 489 RWPKAEVLALINLRSGLETRYQEAGPKGPLWEEISAGMSRMGYKRSAKRCKEKWENINKYFKKVKESNKKR-PEDAKTCPYFHQLDA 574
                  8*********************************************************************8.99999********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.01690152IPR001005SANT/Myb domain
PROSITE profilePS500906.59792150IPR017877Myb-like domain
PfamPF138377.9E-2192178No hitNo description
CDDcd122036.69E-2692157No hitNo description
SMARTSM007170.0083486548IPR001005SANT/Myb domain
PROSITE profilePS500906.748488546IPR017877Myb-like domain
PfamPF138378.8E-24488575No hitNo description
CDDcd122036.21E-29488553No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 666 aa     Download sequence    Send to blast
MQQGGGEGHQ SQYGEVGGGP TTDATSSSHM VSEQSEQLEE ASPISYRPPA AAIGNPDELM  60
MRLAEEGDEG DRLGGDHGGG VGGGAGGVAS GNRWPRQETL ALLKIRSDMD GIFRDATVKG  120
PLWEDVSRKL AELGYKRSAK KCKEKFENVH KYYKRTKDGR GGRQDGKSYK FFSELEALNT  180
TSVTLSKPPI TLATSASLDV APISVGIPMP ISSVWIPPTT TTTTTAIPMS SSMLPMPGSA  240
PPPPPATPFG ISFSSNSSSS SQGFEDEDEI GREPSTDMGG SSRKRKRQSS SREGCSSSSS  300
RKRMMEFFEG LMKQVMQKQE ALQQTFLESI EKREQDRMIR EEAWKRQEMA RLAREHELIA  360
QERAIASSRD AAIISFLQKI TGQTIQLPTT VSTIPSVPPP PTQPATPVVQ PPTPIPTAAP  420
PLHHPPSLPQ QKSHLHHQQQ QQAQNTQLVV KHNQQQEPIP SEVIMAIPEQ KVPPQEIGGS  480
EGIKPASSRW PKAEVLALIN LRSGLETRYQ EAGPKGPLWE EISAGMSRMG YKRSAKRCKE  540
KWENINKYFK KVKESNKKRP EDAKTCPYFH QLDALYRKKI LGSGSSSFSD QNRFEGETSQ  600
QHQDPPMEAP QHSHDQSENK TGTTIDVLTS KENSPGSLFG KGNGRATKKS EDIVREQMEE  660
QEMQMQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1135143KRSAKKCKE
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.187740.0boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012470321.10.0PREDICTED: trihelix transcription factor GTL1
TrEMBLA0A0D2QGT90.0A0A0D2QGT9_G
STRINGSb01g049740.11e-157(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM62262746
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.12e-52GT-2-like 1